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Abstract — This paper reviews statistical methods re- 
cently developed to reconstruct and analyze dark matter 
mass maps from weak lensing observations. The field of 
weak lensing is motivated by the observations made in 
the last decades showing that the visible matter represents 
only about 4-5% of the Universe, the rest being dark. 
The Universe is now thought to be mostly composed by 
an invisible, pressureless matter -potentially relic from 
higher energy theories- called "dark matter" (20-21%) 
and by an even more mysterious term, described in 
Einstein equations as a vacuum energy density, called 
"dark energy" (70%). This "dark" Universe is not well 
described or even understood, so this point could be the 
next breakthrough in cosmology. 

Weak gravitational lensing is believed to be the most 
promising tool to understand the nature of dark matter 
and to constrain the cosmological model used to describe 
the Universe. Gravitational lensing is the process in which 
light from distant galaxies is bent by the gravity of 
intervening mass in the Universe as it travels towards 
us. This bending causes the image of background galaxies 
to appear slightly distorted and can be used to extract 
significant results for cosmology. 

Future weak lensing surveys are already planned in 
order to cover a large fraction of the sky with large 
accuracy. However this increased accuracy also places 
greater demands on the methods used to extract the 
available information. In this paper, we will first describe 
the important steps of the weak lensing processing to 
reconstruct the dark matter distribution from shear es- 
timation. Then we will discuss the problem of statistical 
estimation in order to set constraints on the cosmological 
model. We review the methods which are currently used 
especially new methods based on sparsity. 

Index Terms — Cosmology : Weak Lensing, Methods : 
Statistics, Data Analysis 

I. Introduction 

According to present observations, we believe that 
the majority of the Universe is dark, i.e. does not 
emit electromagnetic radiations. Its presence is inferred 
indirectly from its gravitational effects: on the motions 
of astronomical objects and on light propagation. 



Weak gravitational lensing has been found to be one of 
the most promising tools to probe dark matter and dark 
energy because it provides a method to map directly the 
distribution of dark matter in the Universe (see [1], [2]). 
From this dark matter distribution, the nature of dark 
matter can be better understood and better constraints 
can be set on dark energy because it affects the evolution 
of structures. This method is now widely used but, the 
amplitude of the weak lensing signal is so weak that its 
detection relies on the accuracy of the techniques used to 
analyze the data. Each step of the analysis has required 
the development of advanced techniques dedicated to 
these applications. 

This paper is organized as follow: 

Section 2 aims at giving an overview of weak gravi- 
tational lensing : the basics of the lensing theory and a 
brief description of the weak lensing data analysis. 

Section 3 will be dedicated to the presentation of the 
shear estimation problem. It requires the measurement 
of the shape of millions of galaxies with extremely 
high accuracy, in the presence of observational problems 
such as anisotropic Point Spread Function, pixelisation 
and noise. Methods that are currently used to derive 
the lensing shear field from the shapes of background 
galaxies will be described. 

Section 4 will address the following inverse problem : 
how to derive a dark matter mass map from the measured 
shear field. Because of some observational effects such 
as noise and complex survey geometry, this problem 
can be seen as an ill-posed inverse problem. We will 
present various methods of inversion currently used to 
reconstruct the dark matter mass map from incomplete 
shear map especially a recent promising method of 
interpolation, based on the sparse representation of the 
data. And we will describe the different filtering methods 
which are used to reduce the noise in these dark matter 
mass maps such as linear filters, Bayesian techniques 
and a recent wavelet method. 

Finally, in section 5, we will discuss the problem of 
statistical information extraction in weak lensing data 
in order to constrain the cosmological model. We will 
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introduce different statistics which are of interest in 
weak lensing data analysis. A recent approach based on 
sparse representations will be presented. 



II. Introduction to weak lensing 
A. Gravitational Lensing observations 




Fig. 1. Strong Gravitational Lensing effect observed in the Abell 
2218 cluster (W. Couch et al, 1975 - HST). 

In the beginning of the twentieth century, A. Ein- 
stein predicted that massive bodies could be seen as 
gravitational lenses that bend the path of light rays by 
creating a local curvature in space-time. One of the 
first confirmations of Einstein's new theory was the 
observation during the 1919 eclipse of the deflection 
of light from distant stars by the sun. Since then, a 
wide range of lensing phenomena have been detected. 
The gravitational deflection of light generated by mass 
concentrations along light paths produces magnification, 
multiplication, and distortion of images. These lensing 
effects are illustrated by Fig. [1] which shows one of the 
strongest lens observed : Abell 2218, a massive cluster 
of galaxies some 2 billion light years away towards the 
constellation Draco. The observed gravitational arcs are 
actually the magnified and distorted images of galaxies 
that are about 10 times more distant than the cluster. 

B. Gravitational lensing theory 

The properties of the gravitational lensing effect 
depend on all the projected mass density integrated 
along the line of sight and on the cosmological angular 
distances between the observer, the lens and the source 
(see Fig. [2]). 

1 ) The lens equation: In the thin lens approximation, 
we consider that the lensing effect comes from a single 
matter inhomogeneity located between the source and the 
observer. The system is then divided into three planes: 
the source plane, the lens plane and the observer plane. 
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Fig. 2. Illustration of the gravitational lensing effect by large 
scale structures: the light coming from distant galaxies (on the right) 
traveling toward the observer (on the left) is bent by the structures (in 
the middle). This bending causes the image of background galaxies 
to appear slightly distorted. The structures causing the deformations 
are called gravitational lenses by analogy with classical optics. 

The light ray is supposed to travel without deflection 
between these planes with just a slight deflection a while 
crossing the lens plane (see Fig. [3]). 

In the limit of a thin lens, all the physics of the gravita- 
tional lensing effect is contained in the lens equation that 
relates the true position of the source 0$ to its observed 
position(s) on the sky 0j: 

e s = o I - I ^a(i), (i) 

with £ = Dol^i and Dol, Dls and Dos are respec- 
tively the distance from the observer to the lens, the 
lens to the source, and the observer to the source. The 
deflexion angle a is related to the projected gravitational 
potential ^ obtained by the integration of the 3D New- 
tonian potential 3>(f) along the line of sight: 




where c is the speed of light and Vj_ is the perpendicular 
component of the gradient operator. 

We can distinguish two regimes of gravitational 
lensing. In most cases, the bending of light is small 
and the background galaxies are just slightly distorted. 
This corresponds to the weak lensing effect. Sometimes 
(as seen previously) the bending of light is so extreme, 
that the light travels along two different paths to the 
observer, and multiple images of one single source 
appear on the sky. For this to happen, the lensing effect 
must be strong. In this paper, we will only address the 
weak gravitational lensing regime. 

2) The distortion matrix: The weak gravitational 
lensing effect results in both an isotropic dilation (the 
convergence, k) and an anisotropic distortion (the shear, 
7) of the source. To quantify this effect, the lens equation 
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Fig. 3. The thin lens approximation. 



has to be solved. Assuming Oj is small, a first order 
Taylor series approximation of the distortion operator, 
given by the lens equation, can be done : 



(3) 



where i and j correspond respectively to the i th com- 
ponent in the lens plane and the j th component in the 
source plane and : 
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where Aij are the elements of the matrix A and 5ij 
is the Kronecker delta. All the first order effects (the 
convergence k and the shear 7) can be described by the 
Jacobian matrix A that is called distortion matrix: 
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(5) 



where 71 = 7cos2(/? and 72 = 7 sin 2ip. 



The convergence term k enlarges the background 
objects by increasing their size, and the shear term 7 
stretches them tangentially around the foreground mass. 

3) The gravitational shear (7): The gravitational 
shear 7 describes the anisotropic distortions of back- 
ground galaxy images. It corresponds to a two compo- 
nents field 71 and 72 that can be derived from the shape 
of observed galaxies: 71 describes the shear in the x and 
y directions and 72 describes the shear in the x = y 
and x = —y directions. Using the lens equation, the 
two shear components 71 and 72 can be related to the 



gravitational potential <I> by: 



71 



12 



(6) 



If a galaxy is initially circular with a diameter equal to 
1, the gravitational shear will change this galaxy in an 
ellipsoid with a major axis a = and a minor axis 

b = 1 _ K 1 + | 7 i ■ The eigenvalues of the amplification matrix 
(corresponding to the inverse of the distortion matrix A) 
pro- vide the elongation and the orientation produced 
on the images of lensed sources [2]. The shear 7 is 
frequently represented by a segment representing the 
amplitude and the direction of the distortion (see Fig. [4]). 

4) The convergence (k): The convergence k that 
corresponds to the isotropic distortion of background 
galaxy images is related to the trace of the distortion 
matrix A by: 



tr(A) 
tr(A) 



<5i,i + £2,2 - 
2 - A^(0 7 ) 



2(1 - k). 



(V) 



The convergence k is defined as half the Laplacian 
of the projected gravitational potential A^ and is then 
directly proportional to the projected matter density 
of the lens (see Fig. [4]). For this reason, k is often 
considered as the mass distribution. 
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Fig. 4. Simulated convergence map by [3] covering a 2° x 2° 
field with 1024 x 1024 pixels. The shear map is superimposed to 
the convergence map. The size and the direction of the segments 
represent the amplitude and the direction of the deformation locally. 
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C. Weak lensing data analysis 

The dilation and distortion of images of distant galax- 
ies are directly related to the distribution of the (dark) 
matter and to the geometry and the dynamics of the 
Universe. As a consequence, weak gravitational lensing 
offers unique possibilities for probing the statistical prop- 
erties of dark matter and dark energy in the Universe. 
In the next sections, we detail the different steps of 
the weak lensing data analysis along with the different 
techniques dedicated to these applications. The following 
sub-problems will be addressed: 

1) Shear estimation from the measurement of the 
background galaxy ellipticities. 

2) Inversion methods to derive a dark matter mass 
map from a shear map. 

3) Filtering of the dark matter mass map to reduce 
the noise level. 

4) Statistical analysis of the weak lensing data to 
constrain the cosmological model. 

The constraints that can be obtained on cosmology from 
the weak lensing effect relies strongly on the quality 
of the techniques used to analyze the data, because the 
weak lensing signal is very small. In the following, an 
overview of the different techniques currently used will 
be given along with future prospects. 

III. Shear estimation 

The weak gravitational lensing effect is so small that 
it is not possible to detect it from a single galaxy. The 
fundamental problem is that galaxies are not intrinsically 
circular, so the measured ellipticity is a combination of 
their intrinsic ellipticity e mt and the gravitational lensing 
shear 7. By assuming that the orientation of intrinsic 
ellipticities of galaxies are random, any systematic align- 
ment arises from gravitational lensing. To estimate the 
gravitational shear locally, the measurements of many 
background galaxies must thus be combined. 

From this assumption, to perform an estimation of the 
shear field, we have to correct each galaxy of the field 
for the Point Spread Function (PSF) due to instrumental 
and atmospheric effects that distort the apparent shape 
of background galaxies. A shape determination algorithm 
has then to be applied to estimate the gravitational shear 
from background galaxies. 

A. Correction of the PSF and shape measurements 

A major challenge for weak lensing is the correction 
for the PSF. Each background galaxy (S) is convolved 
by the PSF of the imagery system (H) to produce the 
image that is seen by the instrument (I obs ): 



This effect can contaminate a true lensing signal 
and even for the most modern telescopes, this effect 
is usually at least of the same order of magnitude 
as the weak gravitational lensing shear, and is often 
much larger. Then, we must calibrate the PSF using 
the images of stars. Indeed, stars present in the field 
and which correspond to point sources, provide a direct 
measurement of the PSF, and these can be used to model 
the variation of the PSF across the field by doing an 
interpolation between the points where stars appear on 
the image. 

In this section, we briefly review the different methods 
to correct for instrumental and atmospheric distortions. 
These methods are broadly distinguished in two classes. 
The first class of methods subtract the ellipticity of the 
PSF from that of each galaxy while the second class 
of methods attempt to deconvolve each galaxy from the 
PSF. 

The most used method is the KSB+ method that 
belongs to the first category. This method is the result of 
a series of successive improvements of the original KSB 
method proposed by Kaiser, Squires & Broadhurst [9]. 
The core of the method is based on the measurement of 
the ellipticity of the background galaxies. The weighted 
ellipticity of an object is denned as: 




Ql,l + Q2,2 



Qi,i — Q22 

2Ql,2 



where: 



Qi,j 



j d 2 ew(e)i(e)9 i e j 



(9) 



(10) 



fd 2 6W(9)I(6) 

are the quadrupole moments weighted by a Gaussian 
function W of scale length r estimated from the object 
size, / is the surface brightness of the object and 9 is 
the angular distance from the object center. 

The PSF correction is obtained by subtracting the 
star weighted ellipticity e* from the observed galaxy 
weighted ellipticity e° bs . The corrected galaxy ellipticity 
6i is given by: 



€i = € 



obs 



sm*\ — l. 



_ psm^psm*^ 



(11) 



I obs {9) = S(0)*H(9). 



where i=l,2 and P sm and p sm *) is the smear suscepti- 
bility tensors for the galaxy and star, that can be derived 
from higher-order moments of the images. 

This method has been used by many authors 
although different interpretations of the method have 
introduced differences between each implementations. 
One drawback of the KSB+ method is that for non- 
Gaussian PSFs, the PSF correction is defined poorly 
mathematically. 



(8) 
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In [11], the authors propose a method to account for 
realistic PSF better by convolving with an additional 
kernel to eliminate the anisotropic component of the PSF. 

The other class of methods attempt a deconvolution 
of each galaxy from the PSF. The direct deconvolution 
(of equation [8]) requires an inversion matrix and that 
becomes an ill-posed inverse problem if the matrix H 
is singular (i.e. can not be inverted). Some methods 
have been developed to correct for the PSF without a 
direct deconvolution. These methods try to reproduce 
the observed galaxies by modeling each unconvolved 
background galaxy (the background galaxy as it would 
be seen without a PSF): 



I mod (0) = S mod (e)*H*(6). 



(12) 



The modeled galaxy (S mod ) is then convolved by the 
PSF estimated from the stars present in the field (H*) 
and the galaxy model is tuned so that the convolved 
model (I mod ) reproduces the observed galaxy (I obs ). 
One major problem of these methods is that a parametric 
Gaussian shape is assumed for the PSF and depending 
on the survey the Gaussian functions are sometimes 
badly suited to represent PSF shapes. 

All the methods work by estimating for each galaxy 
an ellipticity e (after PSF correction) whose definition 
can vary between the different methods. The accuracy of 
the shear measurement method depends on the technique 
used to estimate the ellipticity of the galaxies in presence 
of observational effects such as noise, poor pixelisation, 
etc. In the KSB+ method, the ellipticity is derived from 
quadrupole moments weighted by a Gaussian function. 
This method has been used by many authors but it is not 
sufficiently accurate for future surveys. The extension 
of KSB+ to higher order moments has been done to 
allow more complex galaxy shapes. Other methods [6], 
[7], based on the "shapelet" formalism (see [8]) are 
more accurate but many shapelet coefficients are needed 
to represent a galaxy. Indeed, the basis functions of 
shapelets representation constructed from Hermite poly- 
nomials weighted by a Gaussian function are not optimal 
to represent galaxy shapes that are closer to exponential 
functions. By consequence, in presence of noise, the 
accuracy of the shear measurement method based on a 
shapelet decomposition is not optimal. 

Many other methods have been developed to address 
the global problem of the shear estimation. In preparation 
for the next generation of wide-field survey, a wide range 
of the shear estimation methods have been compared 
blindly in the Shear Testing Program (STEP) in order to 
improve the accuracy of the methods ([4], [5]). Several 



methods have achieved an accuracy of a few percents 
in the simulated STEP images. However, the accuracy 
required for future surveys will be greater (of the order 
of 0.1%) and will require new insights. A new challenge 
called GREAT08 ([10]) has been recently set outside 
the weak lensing community as an effort to spur further 
developments. 

B. Shear field estimation 

After PSF correction, a catalogue of galaxies can 
be built with the shape and position of each galaxy 
present in the field. As stated above, the measured shape 
after PSF correction e is a combination of the intrinsic 
ellipticity of the galaxy e mt and the gravitational lensing 
shear 7. Thus, the gravitational lensing shear 7 can 
only be estimated by averaging over a large number of 
galaxies. 

Usually this shape catalogue is used to characterize the 
gravitational shear statistically using correlation func- 
tions or other statistics (see section [V]). It can also be 
used to map the shear field in order to derive a dark 



matter mass map (see section IV 1. 

In practice, the shear map is obtained by pixelisation 
of the field in such a way that several background galax- 
ies fall in each pixel. The shear map is then obtained by 
averaging for each pixel the ellipticity of the galaxies 
falling into the pixel: 

N(i,j) 

e (. x k,Vk), (13) 



1 



N(i,j) 



fe=i 



where 7 corresponds to the estimated shear for the 
pixel (i, j) of the shear map, N(i,j) is the number of 
galaxies within the pixel used to estimate the local 
shear 7(2, j) and e(xk,yk) corresponds to the estimated 
ellipticity of a galaxy at position The resulting 

shear map is subject to some observational effects such 
as noise that arises both from the measurement error 
of galaxy ellipticities cr meas and the residual of galaxy 
intrinsic ellipticities a mt . Another observational effect 
is the masking out of bright stars from the field that 
gives a complex geometry to the survey. Fig. [5] shows 
an example of mask applied to real data. The analysis 
of weak lensing data requires to account for these 



observational effects (see section IV-A and section IV- 



0- 



IV. Mapping the dark matter 

The problem of mass reconstruction has become a 
central topic in weak lensing since the very first maps 
of dark matter have demonstrated that we could see the 
dark side of the Universe. 
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Fig. 5. Mask applied to the CFHTLS data (obtained with the 
Megacam in the Dl field). The mask covers a field of 1° x 1°. (J. 
Berge et al, 2008) 



The reconstruction of the dark matter mass map from 
shear measurements is an ill-posed inverse problem 
because of observational effects such as noise, complex 
geometry of the field, etc. This inverse problem can be 
decomposed in two sub-problems: the reconstruction of 
the dark matter mass map from the shear field (section 
IV-A) and the filtering of the dark matter mass map 
(section IV-B). 

A. Weak lensing inversion 

1 ) Local inversion: We first consider the local inver- 
sions that have the advantage to address two different 
problems : the problem of missing data and the finite 
size of the field. A relation between the gradient of 
K = log(l — k) and combinations of first derivatives 
of g = have been derived by [9] : 

VK = u 

9i,l + 92,2 
92,1 - 91,2 



-1 



1 



1 



\g\ 




u (14) 



This equation can be solved by line integration and there 
exists an infinite number of local inverse formulae which 
are exact for ideal data on a finite-size field. But they 
differ in their sensitivity to observational effects such as 
noise. The reason why different schemes yield different 
results can be seen by noting that the vector field u (the 



right-hand side of equation 14 1 has a rotational compo- 
nent due to noise because it comes from observational 
estimates. 

In [12], the authors have split the vector field u into a 
gradient part and a rotational part and they derive the best 
formula that minimizes the sensitivity to observational 
effects by convolving the gradient part of the vector field 
u with a given kernel. 

The local inversions reduce the unwanted boundary 
effects but whatever the formula is used, the 



reconstructed field is more noisy than that obtained 
with a global inversion. Another drawback is that the 
reconstructed dark matter mass map has still a complex 
geometry that will make the later analysis more difficult. 

2) Global inversion: A global relation between k and 
7 can be derived, from the relations Q and ([7]). Indeed, 
it has been shown by [13] that the least square estimator 
R n of the convergence k in the Fourier domain is: 



P 2 (ki, k 2 )j 2n (k\, fa), 



(15) 



where the hat symbols denotes Fourier transform and: 



Pl{kiM) 



kl 



^2 



and P 2 (k\, k 2 ) 



2fa k 



1K-2 



(16) 



with Pi{k\,k 2 ) = when k\ = k\, and P 2 (k\,k 2 ) 
when k\ = or k 2 = 0. The most important 
drawback of this method is that it requires a convolution 
of shears to be performed over the entire sky. As a 
result, if the observed shear field has a finite size or a 
complex geometry, then the method can produce artifacts 
on the reconstructed convergence distribution near the 
boundaries of the observed field. A solution that has been 
proposed by [15] to deal with missing data consists in 
filling-in judiciously the masked regions by performing 
an "inpainting" method simultaneously with a global 
inversion. Inpainting techniques are an extrapolation 
of the missing information using some priors on the 
solution. This new method uses a prior of sparsity in 
the solution introduced by [16]. It assumes that there 
exists a dictionary $ (here the Discrete Cosine Trans- 
form) where the complete data are sparse and where 
the incomplete data are less sparse. The weak lensing 
inpainting problem consists of recovering a complete 
convergence map k from the incomplete measured shear 
field 7j. The solution is obtained by minimizing: 



min ||<I> k||o subject to 



|| 7i - M(Pi * k) || 2 < a, 
(17) 

where a stands for the noise standard deviation and M 
is the binary mask (i.e. M, = 1 if we have information 
at pixel i, Mi = otherwise). This method enables to 
reconstruct a complete convergence map k that can be 
used to do statistic estimation with a good accuracy (see 
section [V]). A comparison with other probes of the matter 
distribution can also be performed. This comparison is 
usually done after a filtering of the dark matter map (see 



section |IV-B[ ) whose quality will be improved by the 
absence of missing data. 
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B. Weak lensing filtering 

The convergence map obtained by inversion of the 
shear field is very noisy even with a global inversion The 
noise comes from the shear measurement errors and the 
residual intrinsic ellipticities present in the shear maps 
that propagate during the weak lensing inversion. An 
efficient filtering is required to compare the dark matter 
distribution with other probes. 

1) Non-Bayesian methods: 

• Gaussian filter: 

The standard method [13] consists in convolving the 
noisy convergence map k with a Gaussian window 
G with standard deviation oq: 

KG = G*K n = G*Pi*Ji n + G*P 2 * 72n- (18) 

The Gaussian filter is used to suppress the high 
frequencies of the signal. However, a major problem 
is that the quality of the result depends strongly on 
the value of oq that controls the level of smoothing. 

• Wiener filter: 

An alternative to Gaussian filter is the Wiener filter 
([17], [18]) obtained by assigning the following 
weight to each fc-mode: 



w(k) 



(19) 



\k(k)\ 2 + \N(k)\ 2 

In theory, if the noise follows a Gaussian 
distribution, the Wiener filtering provides the 
minimum variance estimator. However, it is not 
the best approach, in particular on small scales 
where non-linear features deviate significantly 
from gaussianity. However, Wiener filter leads to 
reasonable results, generally better than the simple 
Gaussian filter. 

2) Bayesian methods: 

• Bayesian filters 

Some recent filters are based on the Bayesian the- 
ory that considers that some prior information can 
be used to improve the solution. Bayesian filters 
search for a solution that maximizes the a posteriori 
probability using the B ayes' theorem : 

P{n n \K)P{n) 



P(k\k % 



P(Kr 



(20) 



where : 

- P{k u \k) is the likelihood of obtaining the data 
K n given a particular convergence distribution 

K. 

- P{n n ) is the a priori probability of the data 
K n . This terms, called evidence, is simply a 



constant that ensures that the a posteriori prob- 
ability is correctly normalized. 

- P(k) is the a priori probability of the estimated 
convergence map k. This term codifies our 
expectations about the convergence distribution 
before acquisition of the data K n . 

- P(k|k„) is called a posteriori probability. 
Searching for a solution that maximizes P(k\k u ) 
is the same that searching for a solution that mini- 
mizes the following quantity (Q) : 

Q = -log(P(K\K n )), (21) 

Q = -l0g(P(K n \ K ))-\0g(P(K)). (22) 

If the noise is uncorrected and follows a Gaussian 
distribution, the likelihood term P{n n \K) can be 
written: 



P(k h \k) oc exp(-^x 2 ) 



(23) 



with 



X' 



E 

x,y 



(K n (x,y) - n(x,y)y 



(24) 



The equation (22i can be expressed as follows: 
1 



Q 



;X 2 -log(P( K )) = lx 2 



pH, (25) 



2~ 2' 
where (3 is a constant that can be seen as a parameter 
of regularization and H represents the prior that is 
added to the solution. 

If we have no expectations about the convergence 
distribution, the a priori probability P(k) is uniform 
and the maximum a posteriori is equivalent to the 
well-known maximum likelihood. This maximum 
likelihood method has been used by [19], [20] to 
reconstruct the weak lensing field, but the solution 
needs to be regularized in some way to prevent 
overfitting the data. It has been done via the a 
priori probability of the convergence distribution. 
The choice of this prior is one of the most critical 
aspects of the Bayesian analysis. An Entropic prior 
is frequently used but there exists many definitions 
of the Entropy (see [21]). One that is currently used 
is the Maximum Entropy Method (MEM) (see [22], 
[24]). 

Some authors ([19], [23]) have also suggested to 
reconstruct the gravitational potential ^ instead 
of the convergence distribution k, still using a 
Bayesian approach. But this is clearly better to 
reconstruct the mass distribution k directly because 
it allows a more straightforward evaluation of the 
uncertainties in the reconstruction. 
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• Multiscale Bayesian niters 

A multiscale maximum entropy prior has been pro- 
posed by [24] which uses the intrinsic correlation 
functions (ICF) with varying width. The multichan- 
nel MEM-ICF method consists in assuming that 
the visible-space image I is formed by a weighted 
sum of the visible-space image channels Ij, I = 
J2f=iPjIj where N c is the number of channels and 
Ij is the result of the convolution between a hidden 
image hj with a low -pass filter Cj, called ICF 
(Intrinsic Correlation Function) (i.e. Ij = Cj * Oj). 
In practice, the ICF is a Gaussian. The MEM-ICF 
constraint is: 

Hicf = Y]\oj\ -mj- | Oj | log ( L°jJ ] . (26) 

Another approach, based on the sparse representa- 
tion of the data, has been used by [25] that consists 
in replacing the standard Entropy prior by a wavelet 
based prior. Sparse representations of signals have 
received a considerable interest in recent year. The 
problem solved by the sparse representation is to 
search for the most compact representation of a 
signal in terms of linear combination of atoms in 
an overcomplete dictionary. 
The entropy is now defined as : 

J-i 

3=1 k,l 

In this approach, the information content of an 
image I is viewed as sum of information at different 
scales Wj. The function h defines the amount of 
information relative to a given wavelet coefficient. 
Several functions have been proposed for h. 
In [26], the most appropriate entropy for the weak 
lensing reconstruction problem has been found to be 
the NOISE-MS E entropy that presents a quadratic 
behavior for small coefficients and is very close to 
the l\ norm (i.e. absolute value of the wavelet coef- 
ficient) when the coefficient value is large, which is 
known to produce good results for the analysis of 
piecewise smooth images. The proposed filter called 
MRLens (Multi-Resolution for weak Lensing) has 
shown to outperform other techniques (Gaussian, 
Wiener, MEM, MEM-ICF) in the reconstruction of 
dark matter. It has been used to reconstruct the 
largest weak lensing survey ever undertaken with 
the Hubble Space Telescope. The result is shown 
Fig. [6j This map is the most precise and detailed 
dark matter mass map, covering a large enough area 
to see extended filamentary structures. 



Gravitational lensing magnification fC 




150.4 160.2 150.0 141 

Right Ascension [degrees] 



Fig. 6. Map of the dark matter distribution in the 2-square degree 
COSMOS field by [27]: the linear blue scale shows the convergence 
field k, which is proportional to the projected mass along the line of 
sight. Contours begin at 0.4 % and are spaced by 0.5% in k. 

V. COSMOLOGICAL MODEL CONSTRAINTS 

Image distortion measurements of background galax- 
ies caused by large-scale structures provides a direct 
way to study the statistical properties of the growth of 
structures in the Universe. Weak gravitational lensing 
measures the mass and can thus be directly compared to 
theoretical models of structure formation. But because, 
we have only one realization of our Universe, a statistical 
analysis is required to do the comparison. The estimation 
of the cosmological parameters from weak lensing data 
can be seen as an inverse problem. The direct problem 
that consists of deriving weak lensing data from cosmo- 
logical parameters can be solved using numerical simula- 
tions. But the inverse problem cannot be solved so easily 
because the N-body equations used by the numerical 
simulations can not be inverted. A statistical analysis is 
then required to constrain the cosmological parameters. 
The statistical characteristics of the weak lensing field 
can be quantified using a variety of measures estimated 
either in the shear field or in the convergence field. Most 
lensing studies do the statistical analysis in the shear 
field to avoid the inversion. But most of the following 
statistics can also be estimated in the convergence field 
if the missing data are carefully accounted. 

A. Second-order statistics 

The most common method for constraining cosmo- 
logical parameters uses second-order statistics of the 
shear field calculated either in real or Fourier space (or 
Spherical Harmonic space). 

The most popular Fourier space second-order statistic 
is the power spectrum P 7 because it can be easily 
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related to the theoretical 3D matter power spectrum 
P(k, \) to estimate cosmological parameters. The cor- 
relation properties are more convenient in Fourier space, 
but for surveys with complicated geometry due to the 
removal of bright stars, the spatial stationarity is not 
satisfied and the missing data need proper handling. 
Consequently, real space statistics are easier to estimate, 
although statistical error bars are harder to estimate. 
An example of real space second-order statistic is the 
shear variance < 7 2 >, defined as the variance of the 
average shear 7 evaluated in circular patches of varying 
radius 6 S . The shear variance < 7 2 > can be related 
to the underlying 3D matter power spectrum via the 2D 
convergence power spectrum P y . This shear variance has 
been used in many weak lensing analysis to constrain 
cosmological parameters. Another real space statistic is 
the shear two-point correlation function £,^(0) that is 
currently used because it is easy to implement and can 
be estimated even for complex geometry. It is defined as 
follows : 

£^(0)=<7^b# + 0)>, (28) 

where i,j = 1,2 and the averaging is done over pairs 
of galaxies separated by angle 9 = |0|. By parity 
£1,2 = £2,1 = and by isotropy £11 and £2,2 are 
functions only of 0. The shear two-point correlation 
functions can also be related to the underlying 3D 
matter power spectrum via the 2D convergence power 
spectrum P 1 . These two-point correlation functions are 
the most popular statistical tools used in weak lensing 
analysis. The variance of the aperture mass M ap 
[29] that corresponds to an average shear two-point 
correlation has been also used in many weak lensing 
analyses. This statistic is the result of the convolution 
of the shear two-point correlation with a compensated 
filter. Several forms of filters have been suggested which 
trade locality in real space with locality in Fourier space. 

Second-order statistics measure the Gaussian proper- 
ties of the field. This is a limited amount of information 
since it is known that the low redshift Universe is highly 
non-Gaussian on small scales. Indeed, gravitational clus- 
tering is a non linear process and in particular at small 
scales the mass distribution is highly non-Gaussian. 
Consequently, if only second-order statistics are used to 
set constraints on the cosmological model, degenerate 
constraints are obtained between some important cos- 
mological parameters. 

B. Higher-order statistics 

An alternative procedure is to consider higher-order 
statistics of the weak lensing shear field enabling a 



characterization of the non-Gaussian signal. 

The three-point correlation function £jj fc is the 

lowest-order statistics which can be used to detect non- 
Gaussianity. 

ZijM =< 7(01)7(02)7(03) > • (29) 

In Fourier space it is called bispectrum and only 
depends on distances |/i|, | 1 and | Z3 1 : 

B(|*i|,|l2|,|fe|) oc < 7 (N)7(|Q)7*(K3|)>{30) 

It has been shown that tighter constraints can be 
obtained with the three-point correlation function [30]. 

A simpler quantity than the three-point correlation 
function is provided by measuring the third-order mo- 
ment (skewness) of the convergence k that measures the 
asymmetry of the distribution. The convergence skew- 
ness is primarily due to rare and massive dark matter 
halos. The distribution will be more or less skewed 
positively depending on the abundance of rare and 
massive halos. We can also estimate the fourth-order 
moment (kurtosis) of the convergence that measures the 
peakedness of a distribution. A high kurtosis distribution 
has a sharper "peak" and flatter "tails", while a low 
kurtosis distribution has a more rounded peak. 

C. Other non-Gaussian statistics 

The weak lensing field is highly non-Gaussian: on 
small scales, we can observe structures like galaxies, 
groups and clusters and on larger scales, we observe 
some filament structures. Another approach to look 
for non-Gaussianity is to perform a statistical analysis 
directly on the non-Gaussian structures present in the 
convergence field. For example, the galaxy clusters that 
are the largest virialized cosmological structures in the 
Universe can provide a unique way to focus on non- 
Gaussianities present at small scales. One interesting 
statistic is the peak counting that searches the number 
of peaks detected on the convergence field corresponding 
roughly to the cluster abundance. 

D. Statistical approach based on sparsity 

It has been proposed by [14] to do a statistical analysis 
based on the sparse representation of the weak lens- 
ing data. Several representations have been compared: 
Fourier, wavelet, ridgelet and curvelet representations. 
The comparison shows that the wavelet transform is the 
most sensitive to non-Gaussian cosmological structures. 
Indeed, by minimizing the number of large coefficient, 
the wavelet transform makes statistics be more sensitive 
to the non-Gaussianities present in the weak lensing 
field. In the same paper, several non-Gaussian statistics 
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have been compared and the peak counting estimated in a 
wavelet representation, called Wavelet Peak Counting, 
has been found to be the best non-Gaussian statistic to 
constrain cosmological parameters. 

VI. Conclusion 

The weak gravitational lensing effect that is directly 
sensitive to the gravitational potential provides a unique 
method to map the dark matter. This can be used to 
set tighter constraints on cosmological models and to 
understand better the nature of dark matter and dark en- 
ergy. But the constraints derived from this weak lensing 
effect depend on the techniques used to analyze the weak 
lensing signal which is very weak. 

The field of weak gravitational lensing has recently 
seen great success in mapping the distribution of dark 
matter (Fig. [6]). But new methods are now necessary to 
reach the accuracy required by future wide-field surveys 
and ongoing efforts are done to improve the current 
analyses. This paper attempt to give an overview of the 
techniques of signal processing that are currently used 
to analysis the weak lensing signal along with future 
directions. It shows that the weak lensing is a dynamic 
research area in constant progress. 

In this paper, we have detailed the different steps of 
the weak lensing data analysis thus presenting various 
aspects of signal processing. For each problem, we have 
systematically presented a range of methods currently 
used from earliest to up-to-date methods. This paper 
shows that a milestone in weak lensing data analysis 
progress has been the introduction of Bayesian ideas 
that have provided a way to incorporate prior knowledge 
in data analysis. The next one could possibly be the 
introduction of sparsity. Indeed, we have presented new 
methods based on sparse representations of the data that 
have already had some success. 
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